Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

The race to digitize : are we forfeiting quality?

Identifieur interne : 001365 ( Main/Exploration ); précédent : 001364; suivant : 001366

The race to digitize : are we forfeiting quality?

Auteurs : Alice Keller [Royaume-Uni]

Source :

RBID : Pascal:06-0081940

Descripteurs français

English descriptors

Abstract

The article describes the errors and deficiencies found in digitized journal back issues.The results are not based on systematic or comprehensive research, but provide a snapshot of the sort of problems librarians and readers can experience when accessing digitized journals. Errors and deficiencies are classified in the following categories: failed access, inaccurate journal titles, missing elements, insufficient quality of full text images, poor accuracy of OCR (Optical Character Recognition) and inaccurate metadata. Observations of the author indicate that digitized back issues of journals vary greatly in their quality. The conclusion contains a general recommendation that all publishers who have entered the 'race for digitization' should carefully review their quality control procedures and make sure that their products are an accurate reflection of their publishing history and not fraught with errors. The author suggests that publishers and providers should develop and adhere to strict quality standards for digitized journals. Only then can libraries really consider removing print journals from their shelves.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">The race to digitize : are we forfeiting quality?</title>
<author>
<name sortKey="Keller, Alice" sort="Keller, Alice" uniqKey="Keller A" first="Alice" last="Keller">Alice Keller</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Collections Management Oxford University Library Services</s1>
<s3>GBR</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Royaume-Uni</country>
<wicri:noRegion>Collections Management Oxford University Library Services</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">06-0081940</idno>
<date when="2005">2005</date>
<idno type="stanalyst">PASCAL 06-0081940 INIST</idno>
<idno type="RBID">Pascal:06-0081940</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000411</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000376</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000372</idno>
<idno type="wicri:doubleKey">0953-0460:2005:Keller A:the:race:to</idno>
<idno type="wicri:Area/Main/Merge">001402</idno>
<idno type="wicri:Area/Main/Curation">001365</idno>
<idno type="wicri:Area/Main/Exploration">001365</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">The race to digitize : are we forfeiting quality?</title>
<author>
<name sortKey="Keller, Alice" sort="Keller, Alice" uniqKey="Keller A" first="Alice" last="Keller">Alice Keller</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Collections Management Oxford University Library Services</s1>
<s3>GBR</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Royaume-Uni</country>
<wicri:noRegion>Collections Management Oxford University Library Services</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Serials : (United Kingdom Serials Group)</title>
<title level="j" type="abbreviated">Serials : (U.K. Ser. Group)</title>
<idno type="ISSN">0953-0460</idno>
<imprint>
<date when="2005">2005</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Serials : (United Kingdom Serials Group)</title>
<title level="j" type="abbreviated">Serials : (U.K. Ser. Group)</title>
<idno type="ISSN">0953-0460</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Digitizing</term>
<term>Electronic periodical</term>
<term>Error</term>
<term>Information quality</term>
<term>Quality control</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Numérisation</term>
<term>Périodique électronique</term>
<term>Qualité information</term>
<term>Erreur</term>
<term>Contrôle qualité</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Numérisation</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">The article describes the errors and deficiencies found in digitized journal back issues.The results are not based on systematic or comprehensive research, but provide a snapshot of the sort of problems librarians and readers can experience when accessing digitized journals. Errors and deficiencies are classified in the following categories: failed access, inaccurate journal titles, missing elements, insufficient quality of full text images, poor accuracy of OCR (Optical Character Recognition) and inaccurate metadata. Observations of the author indicate that digitized back issues of journals vary greatly in their quality. The conclusion contains a general recommendation that all publishers who have entered the 'race for digitization' should carefully review their quality control procedures and make sure that their products are an accurate reflection of their publishing history and not fraught with errors. The author suggests that publishers and providers should develop and adhere to strict quality standards for digitized journals. Only then can libraries really consider removing print journals from their shelves.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Royaume-Uni</li>
</country>
</list>
<tree>
<country name="Royaume-Uni">
<noRegion>
<name sortKey="Keller, Alice" sort="Keller, Alice" uniqKey="Keller A" first="Alice" last="Keller">Alice Keller</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001365 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001365 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:06-0081940
   |texte=   The race to digitize : are we forfeiting quality?
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024